94 research outputs found
Identification of candidate regulatory sequences in mammalian 3' UTRs by statistical analysis of oligonucleotide distributions
3' untranslated regions (3' UTRs) contain binding sites for many regulatory
elements, and in particular for microRNAs (miRNAs). The importance of
miRNA-mediated post-transcriptional regulation has become increasingly clear in
the last few years.
We propose two complementary approaches to the statistical analysis of
oligonucleotide frequencies in mammalian 3' UTRs aimed at the identification of
candidate binding sites for regulatory elements. The first method is based on
the identification of sets of genes characterized by evolutionarily conserved
overrepresentation of an oligonucleotide. The second method is based on the
identification of oligonucleotides showing statistically significant strand
asymmetry in their distribution in 3' UTRs.
Both methods are able to identify many previously known binding sites located
in 3'UTRs, and in particular seed regions of known miRNAs. Many new candidates
are proposed for experimental verification.Comment: Added two reference
Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrepresented upstream motifs
BACKGROUND: Transcriptional regulation is a key mechanism in the functioning
of the cell, and is mostly effected through transcription factors binding to
specific recognition motifs located upstream of the coding region of the
regulated gene. The computational identification of such motifs is made easier
by the fact that they often appear several times in the upstream region of the
regulated genes, so that the number of occurrences of relevant motifs is often
significantly larger than expected by pure chance. RESULTS: To exploit this
fact, we construct sets of genes characterized by the statistical
overrepresentation of a certain motif in their upstream regions. Then we study
the functional characterization of these sets by analyzing their annotation to
Gene Ontology terms. For the sets showing a statistically significant specific
functional characterization, we conjecture that the upstream motif
characterizing the set is a binding site for a transcription factor involved in
the regulation of the genes in the set. CONCLUSIONS: The method we propose is
able to identify many known binding sites in S. cerevisiae and new candidate
targets of regulation by known transcription factors. Its application to less
well studied organisms is likely to be valuable in the exploration of their
regulatory interaction network.Comment: 19 pages, 1 figure. Published version with several improvements.
Supplementary material available from the author
Ab initio identification of putative human transcription factor binding sites by comparative genomics
We discuss a simple and powerful approach for the ab initio identification of
cis-regulatory motifs involved in transcriptional regulation. The method we
present integrates several elements: human-mouse comparison, statistical
analysis of genomic sequences and the concept of coregulation. We apply it to a
complete scan of the human genome. By using the catalogue of conserved upstream
sequences collected in the CORG database we construct sets of genes sharing the
same overrepresented motif (short DNA sequence) in their upstream regions both
in human and in mouse. We perform this construction for all possible motifs
from 5 to 8 nucleotides in length and then filter the resulting sets looking
for two types of evidence of coregulation: first, we analyze the Gene Ontology
annotation of the genes in the set, searching for statistically significant
common annotations; second, we analyze the expression profiles of the genes in
the set as measured by microarray experiments, searching for evidence of
coexpression. The sets which pass one or both filters are conjectured to
contain a significant fraction of coregulated genes, and the upstream motifs
characterizing the sets are thus good candidates to be the binding sites of the
TF's involved in such regulation. In this way we find various known motifs and
also some new candidate binding sites.Comment: 22 pages, 2 figures. Supplementary material available from the
author
Correlated fragile site expression allows the identification of candidate fragile genes involved in immunity and associated with carcinogenesis
Common fragile sites (cfs) are specific regions in the human genome that are
particularly prone to genomic instability under conditions of replicative
stress. Several investigations support the view that common fragile sites play
a role in carcinogenesis. We discuss a genome-wide approach based on graph
theory and Gene Ontology vocabulary for the functional characterization of
common fragile sites and for the identification of genes that contribute to
tumour cell biology. CFS were assembled in a network based on a simple measure
of correlation among common fragile site patterns of expression. By applying
robust measurements to capture in quantitative terms the non triviality of the
network, we identified several topological features clearly indicating
departure from the Erdos-Renyi random graph model. The most important outcome
was the presence of an unexpected large connected component far below the
percolation threshold. Most of the best characterized common fragile sites
belonged to this connected component. By filtering this connected component
with Gene Ontology, statistically significant shared functional features were
detected. Common fragile sites were found to be enriched for genes associated
to the immune response and to mechanisms involved in tumour progression such as
extracellular space remodeling and angiogenesis. Our results support the
hypothesis that fragile sites serve a function; we propose that fragility is
linked to a coordinated regulation of fragile genes expression.Comment: 18 pages, accepted for publication in BMC Bioinformatic
MOCVD of AlN on epitaxial graphene at extreme temperatures
The initial stages of metal organic chemical vapor deposition (MOCVD) of AlN on epitaxial graphene at temperatures in excess of 1200 °C have been rationalized. The use of epitaxial graphene, in conjunction with high deposition temperatures, can deliver on the realization of nanometer thin AlN whose material quality is characterized by the appearance of luminescent centers with narrow spectral emission at room temperature. It has been elaborated, based on our previous comprehensive ab initio molecular dynamics simulations, that the impact of graphene on AlN growth consists in the way it promotes dissociation of the trimethylaluminum, (CH3)3Al, precursor with subsequent formation of Al adatoms during the initial stages of the deposition process. The high deposition temperatures ensure adequate surface diffusion of the Al adatoms which is an essential factor in material quality enhancement. The role of graphene in intervening with the dissociation of another precursor, trimethylgallium, (CH3)3Ga, has accordingly been speculated by presenting a case of propagation of ultrathin GaN of semiconductor quality. A lower deposition temperature of 1100 °C in this case has better preserved the structural integrity of epitaxial graphene. Breakage and decomposition of the graphene layers has been deduced in the case of AlN deposition at temperatures in excess of 1200 °C
Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector
A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements
Height and body-mass index trajectories of school-aged children and adolescents from 1985 to 2019 in 200 countries and territories: a pooled analysis of 2181 population-based studies with 65 million participants
Summary Background Comparable global data on health and nutrition of school-aged children and adolescents are scarce. We aimed to estimate age trajectories and time trends in mean height and mean body-mass index (BMI), which measures weight gain beyond what is expected from height gain, for school-aged children and adolescents. Methods For this pooled analysis, we used a database of cardiometabolic risk factors collated by the Non-Communicable Disease Risk Factor Collaboration. We applied a Bayesian hierarchical model to estimate trends from 1985 to 2019 in mean height and mean BMI in 1-year age groups for ages 5–19 years. The model allowed for non-linear changes over time in mean height and mean BMI and for non-linear changes with age of children and adolescents, including periods of rapid growth during adolescence. Findings We pooled data from 2181 population-based studies, with measurements of height and weight in 65 million participants in 200 countries and territories. In 2019, we estimated a difference of 20 cm or higher in mean height of 19-year-old adolescents between countries with the tallest populations (the Netherlands, Montenegro, Estonia, and Bosnia and Herzegovina for boys; and the Netherlands, Montenegro, Denmark, and Iceland for girls) and those with the shortest populations (Timor-Leste, Laos, Solomon Islands, and Papua New Guinea for boys; and Guatemala, Bangladesh, Nepal, and Timor-Leste for girls). In the same year, the difference between the highest mean BMI (in Pacific island countries, Kuwait, Bahrain, The Bahamas, Chile, the USA, and New Zealand for both boys and girls and in South Africa for girls) and lowest mean BMI (in India, Bangladesh, Timor-Leste, Ethiopia, and Chad for boys and girls; and in Japan and Romania for girls) was approximately 9–10 kg/m2. In some countries, children aged 5 years started with healthier height or BMI than the global median and, in some cases, as healthy as the best performing countries, but they became progressively less healthy compared with their comparators as they grew older by not growing as tall (eg, boys in Austria and Barbados, and girls in Belgium and Puerto Rico) or gaining too much weight for their height (eg, girls and boys in Kuwait, Bahrain, Fiji, Jamaica, and Mexico; and girls in South Africa and New Zealand). In other countries, growing children overtook the height of their comparators (eg, Latvia, Czech Republic, Morocco, and Iran) or curbed their weight gain (eg, Italy, France, and Croatia) in late childhood and adolescence. When changes in both height and BMI were considered, girls in South Korea, Vietnam, Saudi Arabia, Turkey, and some central Asian countries (eg, Armenia and Azerbaijan), and boys in central and western Europe (eg, Portugal, Denmark, Poland, and Montenegro) had the healthiest changes in anthropometric status over the past 3·5 decades because, compared with children and adolescents in other countries, they had a much larger gain in height than they did in BMI. The unhealthiest changes—gaining too little height, too much weight for their height compared with children in other countries, or both—occurred in many countries in sub-Saharan Africa, New Zealand, and the USA for boys and girls; in Malaysia and some Pacific island nations for boys; and in Mexico for girls. Interpretation The height and BMI trajectories over age and time of school-aged children and adolescents are highly variable across countries, which indicates heterogeneous nutritional quality and lifelong health advantages and risks
- …